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Tlta  information  and  data  contained  herein  have  been 
compiled  from  government  and  nongovernment  technical 
reports  and  are  intended  to  be  used  for  reference  purposes. 
Neither  the  United  States  Government  nor  ITT  Research 
Institute  warrant  the  accuracy  of  this  information  and  data. 
The  user  is  further  cautioned  that  the  data  contained  herein 
may  not  be  used  in  lieu  of  other  contractually  cited 
references  and  specifications.  * 

Publication  of  this  information  is  not  an  expression  of  the 
opinion  of  TheUnited  States  Government  or  of  IIT  Research 
Institute  as  to  the  quality  or  durability  of  any  product 
mentioned  herein  and  any  use  for  advertising  or  promotional 
purposes  of  this  information  in  conjunction  with  the  name  of 
The  United  States  Government  or  HT  Research  Institute 
without  written  permission  is  expressly  prohibited. 
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RADC/ISXSI 

Griffisa  Air  Force  Base,  NY  13441 
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In  Dedication 


This  data  compendium  and  the  design  of  the  Software  Life-cycle  Empirical 
Database  (SLED)  which  it  describes  was  the  result  of  much  research  and  hard 
work  by  Christopher  S.  Turner,  a  Programmer  Analyst  at  the  DACS.  In  January 
of  1983  Chris  was  taken  from  us  as  the  result  of  a  tragic  automobile  accident 
His  loss  has  saddened  his  fellow  workers,  and  he  will  be  missed  for  a  long 
time  to  come. 

The  amount  of  effort  he  put  forth  in  his  unselfish  contributions  toward 
designing  the  SLED  and  completing  the  compendium  has  resulted  in  a  valuable 
research  tool  being  made  available  to  the  software  community.  We  are  pleased 
to  publish  this  compendium  in  dedication  to  Chris's  memory. 
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SECTION  I 


INTRODUCTION 

The  Data  &  Analysis  Center  for  Software  (DACS)  Is  a  DoD  Information 
Analysis  Center  (IAC)  which  was  established  in  response  to  a  well 
recognized  need  for  a  facility  to  serve  as  a  centralized  source  for 
current,  readily  usable  data  and  Information  concerning  software 
technology.  One  of  the  primary  functions  of  the  DACS  is  the  acquisition, 
maintenance,  and  dissemination  of  empirical  software  life  cycle  data  for 
research  purposes.  To  this  end,  this  document  describes  the  data  which  has 
been  acquired  and  which  is  being  maintained  at  the  DACS  and  delineates  the 
procedures  required  for  users  to  obtain  subsets  of  this  data. 

BACKGROUND 

The  DACS  has  acquired  seven  sets  of  data  from  various  sources  and 
maintains  this  data  in  the  Software  Life  Cycle  Empirical  Database  (SLED). 
In  that  each  set  of  data  was  the  result  of  a  data  collection  effort  which 
pursued  individually  specific  objectives,  the  resulting  datasets  differ 
with  regard  to: 

•  The  time  period  represented  by  projects  in  a  dataset 

•  The  portion  of  the  software  life  cycle  represented  by  the  data 

•  The  aspects  of  the  software  development  and/or  maintenance 
processes  measured  by  the  data  collection  activity 

•  The  quality  of  the  data  as  reflected  in  the  verification  and 
validation  procedures  used  in  data  collection 


•  The  subsequent  analyses  supported  by  the  data. 


For  reasons  of  clarity,  this  discussion  of  the  contents  of  the  SLED  will  be 
organized  by  contributing  organizations.  The  titles  of  some  of  the  datasets 
reflect  their  contents  as  opposed  to  the  contributing  organization.  The 
seven  sets  of  data  and  the  sources  associated  with  them  are  provided  below. 


(1)  The  DACS  Productivity  Dataset  -  Data  collected  from  various 
government  and  private  industry  sources  and  compiled  by  Richard 
Nelson  of  RADC. 

(2)  The  Reliability  Dataset  -  Data  collected  at  Bell  Laboratories, 
Whippany,  N.J.  and  compiled  by  John  Musa. 

(3)  The  NASA/SEL  Life  Cycle  Dataset  -  Data  collected  and  contributed 
by  the  Software  Engineering  Laboratory  (SEL)  at  ASA  Goddard 
Space  Flight  Center. 

(4)  The  Verification  A  Validation  (VAV)  Dataset  -  Do  .  collected 
under  several  Independent  VAV  contracts  then  <  larized  and 
delivered  to  the  DACS  by  Logicon  Incorporated. 

(5)  The  ARF  Error  Dataset  -  Data  collected  and  analyzed  on  the 

cleveTopment  of  the  Architecture  Research  Facility  (ARF)  at  the 
Naval  Research  Laboratories  (NRL)  by.  David  Weiss. 

(6)  The  Baseline  Software  Dataset  (BSDS)  -  Data  collected  on  six 

■defense  software  projects  from  various  organizations. 

(7)  The  Operations  and  Maintenance  0AM  Dataset  -  Data  collected  on 

the  operations  and  maintenance  of- the  PAVE  PAWS  radar  system. 


The  datasets  were  generated  at  different  points  in  time  and  it  is 
important  to  consider  this  when  analyzing  data  of  this  nature.  Figure  1-1 
illustrates  the  periods  of  time  represented  by  data  in  each  of  the 
datasets.  Each  of  the  datasets  contains  data  from  various  software  life 
cycle  phases  as  depicted  in  Figure  1-2.  Of  these  seven  datasets,  four  are 
available  in  a  standard  format.  The  remaining  three,  the  NASA/SEL,  the  BSDS 
and  the  0AM  datasets,  because  of  their  extensive  nature,  have  not  been 
processed  into  a  form  which  is  readily  usable  and  as  such  distribution  is 
limited  to  customized  versions  of  these  datasets,  prepared  in  response  to 
individual  requests.  Each  of  the  seven  sets  of  data  is  discussed 


FIGURE  1-1  TIME  PERIODS  REPRESENTED  BY  SLED  DATASETS 
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SECTION  II 


THE  DACS  PRODUCTIVITY  DATASET 


This  dataset  consists  of  summary  data  on  roughly  400  software  projects 
and  was  compiled  by  Richard  Nelson  of  RADC,  [NELSON-78]  whose  intent  was  to 
collect  data  supporting  analysis  of  the  effect  of  Implementation 
methodologies  on  productivity  during  software  development.  The  data  was 
obtained  from  open  literature  and  private  sources  In  Industry  and 
government  and  represents  software  development  projects  dating  from  the 
early  1960's  through  the  mid  1970' s.  The  software  applications  range  from 
avionics  and  space-flight  command  and  control  functions  and  radar  system 
support,  to  off-the-shelf  software  packages,  communications  software,  and 
management  information  systems.  Most  of  the  projects  represent  DoD  or  other 
government  applications. 


Essentially  eight  parameters  are  used  to  describe  each  software 
project  in  the  dataset.  These  parameters  and  their  associated  definitions* 
are  provided  below: 


(1)  Project  Identification  -  An  internally  assigned  identification 
number. 

(2)  Project  Size  -  Number  of  lines  of  source  code  (DSLOC)  in  the 
delivered  product.  Source  lines  are  80  character  source  records 
provided  as  input  to  a  language  processor.  Job  control  language, 
link  edit  language,  data  declarations,  internal  program  data  and 
comment  lines  are  included  in  the  count.  Unmodified  reused  code, 
test  drivers,  "throwaway"  code,  and  external  data  are  not 
included.  Where  the  number  of  lines  of  source  code  are  known,  but 


*The  definitions  for  these  parameters  are  reproduced  from  "Software  Data 
Collection  and  Analysis"  Draft  Report  by  Richard  Nelson,  RADC,  1978 
[NELSON-78]. 


the  content  of  the  source  code  Is  not,  the  assumption  has  been 
made  that  the  code  meets  the  above  definitions.  Where  the  size  of 
the  code  has  been  given  In  computer  words,  an  arbitrary 
conversion  has  been  made  to  DSLOC  by: 

DSLOC  3  size  (computer  words)/2  (for  high  order  language) 

DSLOC  3  size  (computer  words)  (for  assembly  language) 

(3)  Project  Effort  -  Total  effort  in  man-months  (TMM)  required  to 
produce  TPie  software  product.  Includes  effort  In  management, 
administration,  analysis,  operational  support,  and  other  areas 
such  as  documentation,  program  design,  coding  and  test,  l.e.  - 
all  effort  chargeable  to  the  project  by  the  builder. 

(4)  Project  Duration  -  Duration  of  project  In  total  months  (TM) 
derived  from  begin  and  end  dates  of  project,  less  any  "dead  time" 
In  the  project,  e.g.,  work  stoppage. 

(5)  Source  Code  Languages  -  Programming  languages  used  on  the  project 
are  recorded  by  name,  and  are  expressed  as  a  percentage  of  the 
total  DSLOC  written  in  each  different  language,  e.g.,  COBOL  80%, 
ASSEMBLY  15%,  JCL  5%.  Where  a  program  design  language  (PDL)  has 
been  used  to  define  the  program,  the  total  number  of  PDL  lines 
written  is  Included  as  a  separate  quantity  and  Is  not  Included  in 
the  DSLOC  count. 

(6)  Errors  -  Errors  (ERRS)  are  totalled  by  counting  the  number  of 
formally  recorded  Software  Problem  Reports  (SPR)  for  which  a  fix 
has  been  generated  during  the  period  covered  by  the  projecfT 
Redundant  SPR's  are  assumed  to  have  been  eliminated,  i.e., 
multiple  SPR's  reporting  the  same  problem.  (In  general,  equating 
errors  to  SPR's  on  a  one-to-one  basis  produces  a  low  count  for 
all  errors  encountered  during  the  software  development  process 
since  SPR's  are  usually  not  generated  until  such  time  as  the 
software  comes  under  formal  configuration  control.  This  point  Is 
usually  marked  by  the  end  of  software  unit  testing  and  the 
beginning  of  Integration  testing,  hence  many  of  the  errors 
detected  during  coding  and  subsequent  compilation  have  already 
been  removed.  Collecting  data  on  errors  on  the  coding  portion  of 
the  development  process  is  impractical  however,  and  error  data 
derived  from  SPR  data  can  be  extrapolated  to  the  entire 
development  process.) 

(7)  Documentation  -  Delivered  pages  of  documentation  (DOC)  includes 
program  listings,  flow  charts  (low  and  hi -level),  operating 
procedures,  maintenance  procedures  and  any  other  descriptive 
material  covering  the  design,  development,  test,  operation, 
Installation  and  maintenance  of  the  software. 

(8)  Implementation  -  The  implementation  techniques  used  on  the 

software  project  are  recorded  and  expressed  as  a  percentage  of 
the  DSLOC  built  using  the  specific  techniques  of  Structured 
Coding,  (SC),  Top-Down  Design  and  Programming  (TD),  Chief 
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Programmer  Team  (CPT),  Code  Reviews  -or  Inspections  (CR),  and 
Librarian  or  Program  Support  Library  (LIB).  For  definitions  of 
the  above,  refer  to  RADC  TR  74-300,  Vols  1-16,  "Structured 
Programming  Series." 


Additionally,  several  other  data-1 terns,  derived  from  those  mentioned 
previously,  are  recorded  for  projects  in  the  dataset.  These  are: 
Productivity  *  DSLOC/TMM 


Average  Number  of  Personnel  a  TMM/TM 


Error  Rate  *  ERRS/DSLOC 


Error  Rate  (temporal)  *  ERRS/TMM 
Documentation  Rate  >  D0C/0SL0C 


Not  every  parameter  has  been  recorded  for  every  project  In  the 
dataset.  Figure  2-1  displays  the  number  of  project  records  where  each  of 
six  primary  data-ltems  are  recorded.  This  figure  also  displays  the  number 
of  projects  represented  by  combi  nations  of  two  of  these  parameters.  The 
record  formats  for  this  dataset  are  Included  In  Appendix  A. 
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PARAMETERS  AND  COMBINATIONS  OF  PARAMETERS 


THE  SOFTWARE  RELIABILITY  DATASET 


This  dataset  consists  of  software  failure  data  on  16  projects  and  was 
compiled  by  John  Musa  of  Bell  Telephone  Laboratories  whose  objective  was  to 
collect  failure  Interval  data  to  assist  software  managers  In  monitoring 
status  and  predicting  schedules.  Careful  controls  were  employed  during  data 
collection  to  ensure  that  the  data  would  be  of  high  quality.  The  data  was 
collected  throughout  the  mid  1970's  and  represents  projects  of  a  variety  of 
applications  Including  real  time  command  and  control ,  word  processing, 
commercial  and  military. 


For  each  software  failure  In  the  dataset  the  following  Items  are 
recorded: 


(1)  Project  Identification  -  An  Internally  assigned  Identification 
number. 

(2)  Failure  Number  -  A  number  identifying  a  particular  failure. 
Failures  are  consecutively  numbered  from  the  first  failure 
reported. 

(3)  Failure  Interval  -  The  time  elapase  from  the  previous  failure  to 
^  current  Failure  In  seconds.  For  one  of  the  projects  In  the 
dataset  this  time  is  given  In  CPU  seconds,  for  the  remaining 
projects  this  time  Is  given  In  wall -clock  seconds. 

(4)  Day  of  Failure  -  Represents  the  day  on  which  the  failure  occurred 
1 n  terms  of  the  number  of  working  days  from  the  start  of  the 
current  phase  or  data  collection  period. 


Figure  2-2  displays  the  size  of  each  sample  of  failure  data  for  each 
project  In  the  dataset,  as  well  as  other  useful  Information  concerning  the 
Individual  projects.  More  detailed  information  on  the  specific 
characteristics  of  each  project  Is  available  in  a  report  entitled  "Software 
Reliability  Data"  by  John  Musa,  Bell  Telephone  Laboratories  [MUSA-79].  The 
record  formats  for  this  dataset  are  provided  In  Appendix  B. 
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NOIL:  Starred  phases  indicate  the  failure  sample  is  not  complete  for  that  phase. 


THE  NASA/SEL  SOFTWARE  LIFE  CYCLE  DATASET 


The  Software  Engineering  Laboratory  (SEL)  was  established  by  NASA's 
Goddard  Space  Flight  Center  (GSFC)  In  1977  to  investigate  the  effectiveness 
of  software  engineering  techniques  as  applied  to  the  development  of 
ground- support  space  flight  dynamics  systems.  The  overall  goals 
[NASA/ SEL -82A]  of  the  program  are  to: 

(1)  Understand  the  software  development  process  In  the  GSFC 
environment. 

(2)  Measure  the  effects  of  various  development  models,  tools,  and 
methodologies  on  the  development  process. 

(3)  Identify  and  apply  Improved  methodologies  In  the  GSFC 
envi ronment. 

To  accomplish  these  goals,  the  SEL  collects  extensive  data  on  software 
developed  by  the  Systems  Development  Section  at  NASA/GSFC,  which  Is 
responsible  for  generating  flight  dynamics  support  software  for 
GSFC-supported  missions.  This  data  Is  forwarded  to  the  DACS  on  a  roughly 
annual  basis. 

The  projects  represented  by  the  data  span  the  functions  of  attitude 
determination,  attitude  control,  maneuver  planning,  orbit  adjustment  and 
general  mission  analysis  support  systems.  The  data  collected  encompasses 
software  development  projects  that  started  as  early  as  1976  and  projects 
currently  under  development.  Figure  2-3  illustrates  the  history  of  projects 
covered  In  the  dataset. 

The  data  contained  In  the  NASA/SEL  Software  Life  Cycle  Dataset  is 
gathered  through  five  sources: 
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•  Software  Engineering  Forms 

•  Computer  Accounting 

•  Personal  Interviews 

•  Automated  Tools 

•  Management  Summaries 


The  files  in  which  some  of  the  data  is  stored  are  structured  according 
to  the  forms  used  to  collect  the  data.  These  forms  are  summarized  in  Figure 
2-4,  reproduced  from  [NASA/SEL-81A].  The  data  collected  through  these 
forms  and  other  media  is  maintained  in  11  files,  distributed  by  the  DACS. 
These  files  and  their  contents  are  described  below: 


Encoding  Dictionary  File  -  One  record  is  provided  for  each  code  used 
in  the  other  files  to  represent  lengthy  alpha-numeric  or  text 
Information.  The  information  provided  in  this  file,  is  included 
in  record  format  descriptions  of  the  other  files. 

Estimated  Statistics  File  -  One  record  is  provided  for  each  project. 
Summarizes  size,  effort,  and  some  computer  environment  data.  Data 
is  actual,  not  estimated  as  implied  by  file  name. 

Header  File  -  One  record  is  provided  for  each  project.  Provides 
schedule  dates  for  the  requirements  definition,  design,  code, 
system  test,  acceptance  test  and  cleanup  phases.  Maintenance 
start  and  end  dates  are  also  provided. 

Change  Report  File  -  One  record  is  provided  for  each  error  or  change 
occurring  in  a  project.  A  record  summarizes  the  information 
recorded  on  a  Change  Report  form;  Including  dates,  effort 

required  for  change,  type  of  change,  and  information  classifying 
the  error  if  the  change  was  a  result  of  an  error. 

Component  Status  Report  File  -  One  record  is  provided  for  each  line  on 
each  Component  Status  Report  form  (completed  weekly  during 

development).  Several  records  provide  the  hours  spent  on  design, 
code,  and  test  activities  for  each  component  of  a  given  project 
on  a  weekly  basis. 

Component  Summary  File  -  Two  records  are  provided  for  each  component 
in  each  project.  Records  summarize  a  general  description  of  each 
component  including  complexity,  application,  size,  schedule, 
effort  to  develop,  and  language.  Each  record  is  either  an 
estimate  of  these  parameters  provided  when  the  component  is  first 
defined  or  the  actual  values  of  the  parameters  upon  completion  of 
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Form 

Description  of  Content 

General  Project 
Summary 

Computer  resources  used,  starting  and 
ending  dates  of  each  phase,  cost 
information,  size  of  product, 
methodologies  and  tools  used  in  each 
phase  of  development,  personnel 
involved,  standards  used, 
documentation  produced,  problems 
anticipated,  and  quality  assurance 
information 

Change  Report 

Change  description,  components 
changed,  effort  to  change,  type  of 
change  or  error,  and  activities  used 
to  validate  changes,  to  detect  errors, 
and  to  find  their  cause 

Resource  Summary 

Number  of  hours  of  worktime  per  week 
per  staff  member  spent  on  a  particular 
project,  computer  usage,  and  other 
charges 

Component  Status 

Time  spent  during  the  week  in  a 
certain  activity  of  component 
development  (e.g.,  design,  testing,  or 
documentation) 

Component  Summary 

Interfaces,  programming  language,  com¬ 
plexity,  resources  required  for  each 
phase  of  development,  relation  to 
other  components,  and  code 
spec  if ications 

Run  Analysis 

Computer  used,  purpose  of  the  run, 
type  of  run,  run  results,  and  comments 

Maintenance  Report 

Subset  of  change  report  with  some 
maintenance-specific  questions 

the  project. 
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Resource  Summary  File  -  One  record  is  provided  for  each  line  on  the 
Resource  Summary  form  (each  line  on  the  form  being  completed 
weekly  by  project  management,  up  to  11  weeks  per  form).  Records 
summarize  the  consumption  of  manpower,  computer,  support  services 
or  other  resources  for  a  specified  time  period. 

Run  Analysis  File  -  One  record  is  provided  for  each  line  on  the  Run 
Analysis  form  (up  to  nine  computer  runs  per  form).  Records 
summarize  the  objectives  and  results  of  each  computer  job 
submitted,  and  whether  the  run  was  interactive  or  batch.  Date  of 
run  is  also  provided. 

Component  Information  File  -  One  record  is  provided  for  each  component 
in  each  project.  Records  provide  information  obtained  from  Source 
Analyzer  Programs,  including  a  number  of  Halstead  complexity 
metrics,  and  instruction  mix  parameters. 

Growth  History  File  -  One  record  is  provided  for  each  week  during  the 
life  cycle  of  several  NASA  projects.  Records  provide  the 
cumulative  number  of  source  lines  written,  modules  and  changes 
for  each  week,  and  are  generated  through  an  automated  collection 
tool . 

More  comprehensive  lists  of  the  parameters  contained  in  each  file  are 
provided  in  Appendix  C.  More  detailed  information  regarding  the 
organization  and  contents  of  the  NASA/SEL  Dataset  may  be  found  in 
[ NASA/SEL -81B]  and  [NASA/SEL -81C] . 


I  Not  every  project  recorded  in  the  database  contains  complete  data. 

Figure  2-5  displays  the  number  of  records  in  each  file  recorded  for  each 
project. 
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Project 

Code 


1 

2 

3 

5 

6 
7 
3 
9 

10 

13 

15 

16 

19 

20 
21 
26 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 
57 
53 

59 

60 
61 
62 
53 


NASA-SEL  Files 


EST 

HF 

CIF 

CSF 

CSR 

CRF 

RAF 

RSF 

HIS 

CMT 

1 

1 

191 

121 

392 

0 

216 

0 

0 

146 

1 

1 

415 

225 

1522 

290 

1164 

92 

42 

618 

1 

1 

49 

0 

138 

0 

0 

0 

0 

0 

1 

1 

683 

126 

1027 

311 

2018 

99 

36 

1064 

1 

1 

736 

175 

1788 

491 

1877 

121 

53 

1119 

1 

1 

53 

55 

153 

55 

186 

11 

0 

217 

1 

1 

539 

316 

810 

240 

984 

60 

25 

823 

1 

1 

48 

22 

422 

0 

403 

0 

0 

165 

1 

1 

944 

295 

1025 

46 

1312 

91 

34 

423 

1 

1 

53 

0 

79 

0 

0 

0 

0 

0 

1 

1 

86 

0 

155 

0 

58 

0 

0 

36 

1 

1 

12 

1 

46 

1 

0 

0 

0 

0 

1 

1 

815 

865 

2175 

686 

3168 

162 

53 

3073 

1 

1 

28 

0 

224 

0 

74 

20 

0 

0 

1 

1 

465 

0 

0 

182 

45 

253 

0 

393 

1 

1 

902 

564 

2407 

585 

2330 

147 

58 

1574 

1 

1 

46 

0 

392 

0 

2 

20 

0 

0 

1 

1 

111 

0 

425 

103 

77 

63 

0 

213 

1 

1 

530 

82 

5191 

930 

4860 

211 

63 

5218 

1 

1 

518 

108 

5160 

751 

7101 

216 

62 

5671 

1 

1 

138 

179 

722 

0 

362 

93 

54 

290 

1 

1 

83 

71 

502 

15 

111 

110 

0 

128 

1 

1 

219 

67 

1331 

132 

1017 

145 

52 

1380 

1 

1 

233 

161 

682 

85 

1 

22 

0 

326 

1 

1 

278 

3 

321 

0 

78 

79 

0 

34 

1 

1 

140 

0 

601 

5 

793 

31 

0 

422 

1 

1 

67 

0 

0 

0 

0 

15 

0 

0 

1 

1 

477 

1 

2011 

5 

0 

165 

46 

14 

1 

1 

129 

1 

2657 

67 

0 

124 

32 

0 

1 

1 

211 

0 

538 

125 

0 

43 

0 

91 

1 

1 

81 

0 

183 

8 

0 

36 

44 

25 

1 

1 

65 

0 

442 

24 

0 

30 

43 

58 

1 

1 

826 

0 

2145 

41 

5 

113 

44 

3 

1 

1 

349 

0 

649 

6 

0 

88 

39 

23 

1 

1 

1 

0 

113 

0 

0 

12 

0 

0 

File  Codes: 

EST  -  Estimated  Statistics  File 
HF  -  Header  File  (Scheduling  Data) 
CIF  -  Component  Information  File 
CSF  -  Component  Summary  File 
CSR  -  Component  Status  Report  File 
CRF  -  Change  Report  File 
RAF  -  Run  Analysis  File 
RSF  -  Resource  Summary  File 
HIS  -  Growth  History  File 
CMT  -  Comment  File 


FIGURE  2-5  CONTENTS  OF  THE  NASA/SEL  DATASET  BY  PROJECT 
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THE  VERIFICATION  AND  VALIDATION  (V&V)  DATASET 


This  dataset  contains  data  collected  during  the  independent 
Verification  and  Validation  ( V&V )  of  five  software  projects.  The  purpose  of 
this  data  collection  effort  was  to  record  the  types  of  errors  which  can  be 
detected  during  independent  V&V  activities.  Data  on  three  of  these  projects 
has  been  recorded  at  the  subsystem  level  as  these  subsystems  underwent 
separate  V&V. 


The  data  in  this  dataset  is  maintained  in  two  files  described  below: 


•  Summary  Data  File  -  This  file  contains  summary  level  information 
describing  each  project  or  subsystem  compiled  from  various  sources. 
Information  includes  size,  schedule  and  development  methodology 
information. 

•  Anomaly  Report  Data  -  This  file  contains  data  describing  each 

anomaly  or  error  uncovered  by  the  independent  V&V  team.  Anomalies 
included  occurrences  of  non-conformity  to  pre-established 

standards.  Each  anomaly  report  contains  information  specifying  the 
anomoly  location,  category,  and  severity  as  well  as  tools  used  to 
detect  and  correct  the  anomaly. 


The  actual  data  parameters  recorded  in  each  of  the  two  files  are  provided 
in  the  record  formats  illustrated  in  Appendix  D. 


The  data  contained  in  this  dataset  is  complete  and  of  very  good 
quality  due  to  stringent  control  placed  on  the  data  collection  effort. 
Figure  2-6  summarizes  the  data  recorded  for  each  project  or  subsystem. 
Further  background  information  may  be  obtained  from  [RADA-81]. 
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System/ Subsystem 
Code 


System/Subsystem 

Size 


Schedule 
Start  End 


Number  of 
Anomaly  Reports 


1  A 

24,000 

9/74 

4/76 

37 

B 

24,000 

2/73 

11/75 

28 

C 

41,000 

10/73 

1/76 

142 

2  A 

24,000 

1/76 

6/78 

48 

B 

24,000 

7/75 

6/78 

46 

C 

41,000 

3/76 

3/78 

141 

3  A 

14,000 

1/75 

9/76 

54 

B 

39,000 

1/75 

7/78 

117 

4 

9/77 

12/79 

84 

5 

1/74 

1/81 

235 

TOTAL  207 


TOTAL  235 


TOTAL  171 


FIGURE  2-6  CONTENTS  OF  THE  V&V  DATASET 


THE  ARCHITECTURE  RESEARCH  FACILITY  (ARF)  ERROR  DATASET 


The  Architecture  Research  Facility  (ARF)  was  developed  at  the  Naval 
Research  Laboratories  (NRL)  to  aid  the  rapid  simulation  of  different 
computer  architectures  for  research  and  evaluation  purposes.  Data  was 
collected  during  the  development  of  this  project  to  evaluate  development 
methodologies  used  on  the  project.  This  data  was  compiled  by  David  Weiss  of 
the  NRL.  Dr.  Weiss  was  not  involved  in  the  development  of  the  ARF  but 
functioned  as  an  independent  collector  and  validator  of  the  data.  This  data 
was  provided  to  the  DACS  in  hardcopy  form  in  March  1982,  to  be  processed 
and  entered  online. 

The  dataset  consists  of  117  error  reports  dealing  with  143  errors 
isolated  and  corrected  during  ARF  development.  Also  included  in  this 
dataset  are  253  records  describing  each  routine  of  which  the  project  is 
comprised  and  several  project  descriptive  records.  The  contents  of  each 
record  type  are  described  below. 


•  Error  Report  Data  -  one  record  is  provided  for  each  error  report. 
Contains  the  dates  the  error  was  observed  and  corrected,  the  type 
of  error,  the  routine  in  which  the  error  was  isolated,  the  effort 
required  to  isolate  and  correct  the  error,  and  the  activities  and 
tools  used  to  isolate  the  error  and  validate  the  routine. 

•  Module  Descriptive  Data  -  one  record  is  available  for  each  routine 
in  the  system.  Contains  the  site  of  the  routine,  number  of 
pre-processor  statements,  number  of  comments,  subjective  complexity 
of  the  routine,  and  the  function  of  the  module  or  routine. 

•  System  Descriptive  Data  -  several  re.ords  are  provided  for  the 
project.  Contain  or  describe  the  size  and  other  attributes  of  the 
project,  a  description  of  the  development  environment,  and  a 
description  of  the  development  methodologies  used  on  the  project. 


More  detailed  listings  of  the  da ta-i terns  contained  in  the  ARF  dataset 
are  provided  in  Appendix  E.  The  data  is  of  generally  good  quality  and  there 
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are  very  few  missing  data-ltems.  Figure  2-7  Illustrates  the  percentage  of 
major  parameter  occurrences  In  the  dataset.  Information  not  available  in 
System  Descriptive  Data  may  be  obtained  from  two  sources  [WEISS-78]  and 
[ELOV-79] . 


Parameter 

Number  of  Records 
Containing  Parameter 

Percent 

Type  of  Change 

117 

100% 

Effort  of  Change 

116 

>99% 

Date  Change  was  Determined 

116 

>99% 

Type  of  Error 

116 

>99% 

Time  to  Isolate  Error 

115 

>99% 

Activities  Used  to  Isolate  Error 

114 

>99% 

Programmer  Code 

116 

>99% 

Total  Number  of  Records 


117 


THE  OPERATIONS  &  MAINTENANCE  (04M)  DATASET 


The  PAVE  Phased  Array  Warning  Systems  (PAWS)  is  an  over-the-horizon  j 

radar  system  in  operation  at  Otis  Air  Force  Base,  Massachusetts  and  Beale 
Air  Force  Base,  California.  The  software  for  this  system  is  essentially  the 
first  large  DoD  software  project  developed  and  maintained  in  a  j 

comprehensive  environment  of  software  engineering  tools  and  techniques. 

Parallel  to  the  development  of  this  system,  an  extensive  data  collection 

i 

process  was  Initiated,  employing  the  facilities  of  the  Program  Support  | 

Library  (PSL).  The  PSL  is  a  portion  of  the  PAVE  PAWS  software  developed  to 
manage  the  configuration  of  the  remainder  of  the  project.  In  addition  to 

t 

this  the  PSL  collects  and  maintains  size  and  change  data  on  a  continual  I 

basis.  As  such,  a  vast  amount  of  very  detailed  data  tracking  maintenance 

( 

activities  Is  continually  being  collected  through  the  operational  life  of 

t 

the  project.  Full  details  of  the  data  collection  effort  and  subsequent  ' 

analysis  are  available  in  [IITR-82]. 


The  data  collected  against  the  PAVE  PAWS  project  is  maintained  at  the 
DACS  in  three  primary  forms: 


•  Online  Data  Files  -  consists  of  seven  online  datafiles,  summarizing 
information  collected  from  other  sources.  These  are  discussed 
individually  below. 

t  Offline  Data  Files  -  consists  of  machine  readable  data  files  and 
reports  produced  by  the  PSL.  These  are  stored  on  magnetic  tape  and 
updated  monthly. 

•  Hardcopy  Data  -  consists  of  analyses  of  unresolved  discrepancies, 
minutes  from  configuration  review  board  meetings,  and  project 
specification  and  source  code  data. 


The  most  accessible  and  easily  processed  data  is  included  in  the 
online  data  files.  These  seven  files  and  their  respective  contents  are 


provided  below: 


(1)  Maintenance  Activity  File  -  This  file  records  maintenance 
activities  performed  on  the  project.  Contains  for  each  approved 
change,  the  type  of  maintenance  activity  being  performed,  the 
precision  of  specifications  for  that  activity,  the  complexity  of 
the  maintenance  effort,  and  data  concerning  how  the  error  was 
detected  and  diagnosed. 

(2)  CPCG  Description  File  -  Contains  data  providing  information 
concerning  the  characteristics  of  the  PAVE  PAWS  software  at  the 
CPCG  level,  Including  size  of  the  CPCG  In  source  lines  and  words 
of  machine  code,  number  of  CPCI's,  environmental  factors  and 
development  constraints.  (CPCG  is  an  acronym  for  Computer  Program 
Configuration  Group,  CPCI  is  an  acronym  for  Computer  Program 
Configuration  Item.) 

(3)  CPCG  Status  File  -  Contains  information  concerning  the  status  of 
CPCG' s  3ur1ng  the  maintenance  cycle.  The  size  of  the  CPCG,  Its 
version  identification,  and  the  date  the  last  change  was  made  to 
the  CPCG. 

(4)  Segment  Change  History  File  -  Contains  a  history  of  changes  or 
additions  to  the  project  at  the  segment  level.  The  size  of  the 
segment,  and  the  number  of  changes  made  to  the  segment  Is 
recorded  each  time  a  change  occurred. 

(5)  Change  History  File  -  Contains  a  history  of  changes  or  additions 

tfiie  project  at  the  program  level.  The  size  of  the  program  (In 
number  of  segments)  is  maintained  at  each  change. 

(6)  Discrepancy  Report  History  File  -  Contains  information  recorded 
on  each  Hi  screpancy  Report  including  the  origin  and  dates 
associated  with  actions  taken. 

(7)  Personnel  Experience  Profile  -  Provides  background  information  on 
personnel  assigned  to  the  maintenance  of  PAVE  PAWS  software, 
including  education,  work  experience,  programming  language 
experience,  and  experience  on  related  projects. 


Detailed  contents  of  each  of  these  files  are  available  in  Appendix  F. 
Figure  2-8  provides  the  number  of  records  in  each  of  these  files. 


The  majority  of  data  collected  by  the  PSL  is  stored  in  essentially  one 
file,  the  Configuration  Management  Data  Base.  This  file  consists  of  a 
number  of  Discrepancy  Report  records,  describing  the  status  of  changes  made 
to  the  system.  Each  record  contains  the  following  types  of  information: 
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File 


Number  of  Records 


Maintenance  Activity  File  984 
CPCG  Description  File  64 
CPCG  Status  File  64 
Segment  Change  History  File  7 
Change  History  File  7 
Discrepency  Report  History  File  18 


Programmer  Experience  Profile  File 


81 


•  A  description  of  the  discrepancy  report; 

•  The  date  the  discrepancy  report  was  opened. 

•  The  priority  of  action  to  be  taken. 

•  Oates  associated  with  various  stages  of  maintenance. 

•  Effort  expended  on  the  maintenance  activity. 

More  detailed  discussions  of  the  functions  of  the  PSL  are  provided  In 
[LUPINO-74]  and  [TINANOFF-74]. 
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THE  BASELINE  SOFTWARE  DATA  SYSTEM  (BSDS)  DATASET 


This  DACS  dataset  consists  of  data  describing  six  large  software 
development  efforts.  These  projects  and  the  data  are  described  in  [RYE-77], 
[BAKER- 77],  [WILLIAM-77],  [FRIES-77],  [THAY-70],  and  [HECT-77].  The  system 
application  areas  encompass  command  and  control,  real-time  control  for 
land-based  radar,  onboard  guidance  and  navigation,  and  database  management. 

The  majority  of  data  contained  in  this  dataset  is  derived  from 
Software  Problem  Reports  collected  against  the  six  projects.  These  problem 
reports  are  described  in  terms  of: 

•  The  dates  that  the  problem  report  was  opened  and  closed. 

•  The  module  manifesting  the  problem. 

•  The  module  changed  to  correct  the  problem. 

•  The  problem  category. 

•  The  problem  priority. 

•  The  corrective  action. 

Three  of  the  projects  also  contain  data  describing  modules.  This  data 
includes  the  size  of  the  module  in  source  instructions,  the  source 
programming  language  used,  the  type  of  construction  for  the  module,  and  a 
designation  of  the  functional  application  of  the  module.  Additionally,  one 
project  contains  data  describing  each  test  run  during  development.  Figure 
2-9  summarizes  the  number  of  records  and  types  of  records  provided  for  each 
project  in  the  dataset.  Detailed  record  formats  are  not  available  for  this 
dataset,  however,  Appendix  G  provides  a  comprehensive  list  of  parameters 
recorded. 
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jProject 

Number 


Number  of 
Records 


Number  of 
Data  Items 


Number  of 
Record  Types 


1 

4,970 

28 

1 

2 

2,113 

46 

5 

3 

2,274 

35 

2 

4 

11,730 

17 

1 

1  5 

8,106 

18 

2 

6 

2,719 

15 

1 

i 

FIGURE  2-9  PROJECT  IN  THE  BSDS  DATASET 


Reproduced  from  [DUVALL-7911 


THE  DACS  COMPOSITE  PRODUCTIVITY  DATASET 


The  SLED  Is  currently  undergoing  enhancement  through  a  restructuring  of 
copies  of  portions  of  its  contents.  This  will  result  in  a  database  that  may 
be  searched  for  projects  of  a  specific  language,  application  or  other 
qualifying  characteristic.  As  part  of  the  investigation  into  the 
feasibility  of  performing  this  effort  a  dataset  was  constructed  containing 
data  extracted  from  three  of  the  datasets  discussed  in  the  previous 
section.  The  DACS  Composite  Productivity  Dataset  contains  data  from  the 
DACS  Productivity  Dataset,  the  NASA/SEL  Dataset,  and  the  V&V  Dataset. 
Essentially,  the  same  parameters  provided  in  the  RADC  dataset  are  provided 
here,  with  the  exception  of  source  language  composition.  The  DACS  Composite 
Productivity  Dataset  provides  only  the  primary  source  language  as  opposed 
to  the  three  primary  source  languages.  Additionally,  a  field  has  been  added 
to  Indicate  from  which  dataset  the  project  originated.  The  data  contained 
In  this  dataset  will  be  verified  and  additional  fields  and  records  will  be 
added  on  a  continuing  basis.  Figure  2-10  illustrates  the  composition  of  the 
dataset  and  detailed  record  formats  are  provided  in  Appendix  H. 
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ORDERING  INFORMATION 


The  DACS  is  a  Defense  Logistics  Agency  (DLA)  information  analysis 

center  (IAC).  As  a  DoD/DLA  IAC  the  DACS  is  required  to  institute  a  system  j 

of  user  charges  with  the  object  of  recovering  a  significant  portion  of  the 
cost  of  operating  the  DACS.  Charges  for  DACS  products  are  designed  to 
recover  a  portion  of  their  development  cost  and  the  cost  of  reproduction,  j 

handling  and  mailing. 

Orders  for  DACS  products  must  be  placed  directly  with  the  DACS.  Orders 
should  clearly  specify  the  dataset(s)  and  services  desired.  Except  for 
blanket  purchase  orders,  prepayment  is  required.  Please  make  checks  payable 
to:  I ITRI/DACS. 

At  this  time,  the  OACS  is  distributing  copies  of  the  Software 

Reliability  Dataset,  the  DACS  Productivity  Dataset,  the  V&V  Dataset,  the 

a 

ARF  Error  Dataset  and  the  OACS  Composite  Productivity  Dataset  in  * 

standard  formats.  The  price  lists  at  the  end  of  this  compendium  give 
prepaid  prices  for  these  standard  DACS  datasets.  Subsets  of  the  Baseline 
Software  Dataset,  the  NASA/SEL  Oataset,  and  the  O&M  Dataset  are  available  \ 

at  costs  dependent  upon  processing  time. 

Call  or  write  for  assistarce  in  determining  the  availability  of  data  I 

to  meet  your  research  needs  and  to  determine  its  cost. 
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the  following: 


Unsorted  (dataset  order) 
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(  )  Sorted  on  delivered  source  code  lines 
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APPENDIX  A 


i 


RECORD  FORMATS  and  DATA  DICTIONARY 
for  the 

DACS  PRODUCTIVITY  DATASET 


Record  Format 

DACS  Productivity  Dataset 

Record  Type  A 


Field 

Descripti  on 

Format 

Comments 

9-  12 

Project  Code 

1(4) 

13 

Record  Type 

A(  1) 

"A" 

14-  16 

Source  of  Data 

1(3) 

References  Nelsons 

18-  57 
59-  69 

Filler 

Primary  Language 

A(  11) 

Bib! iography 

71-  73 

Percent  of  Primary  Language 

1(3) 

Record  Format 

DACS  Productivity  Dataset 

Record  Type  B 

Fi  el  d 

Description 

Format  Comments 

9-  12 

Project  Code 

1(4) 

13 

Record  Type 

A ( 1 )  "B" 

14-  25 

Secondary  Language 

A(  12) 

27-  29 

Percent  of  Secondary  Language  Usage 

1(3) 

31-  42 

Tertiary  Language 

A(  12 ) 

44-  46 

Percent  of  Tertiary  Language  Usage 

1(3) 

48-  53 

Source  Code  Written  in  a  POL 

1(6) 

55-  59 

Ratio  of  PDL  Source  to  Total 

F  ( 5 .3 ) 

61-  66 

Pages  of  Documentation 

1(6) 

68-  72 

Ratio  Documentation  to  Total  Source 

F(5.3) 

Record  Format 

DACS  Productivity  Dataset 

Record  Type  C 

Fiel  d 

Descri pti on 

Format 

Comments 

9-  12 

Project  Code 

1(4) 

13 

Record  Type 

A(  1) 

11 C" 

14-  20 

Delivered  Source  Lines  of  Code 

1(7) 

22-  26 

Manmonths  of  Development  Effort 

1(5) 

Charged  to  Project 

28-  30 

Development  Schedule  Length 

1(3) 

To  nearest  month 

32-  36 

Productivity  (Lines  per  Manmonth) 

1(5) 

38-  40 

Structured  Code  Usage 

1(3) 

Percent  of  total 

source 

42-  44 

Top-Down  Programming  Usage 

1(3) 

Percent  of  total 

source 

46-  48 

Chief  Programmer  Team  Usage 

1(3) 

Percent  of  total 

source 

50-  52 

Programmer/Librarian  Usage 

1(3) 

Percent  of  total 

source 

54-  56 

Formal  Code  Review  Usage 

1(3) 

Percent  of  total 

source 

58-  62 

Number  of  Software  Problem  Reports 

1(5) 

64-  59 

Ratio  of  SPRs  to  Total  Source 

F(6.3) 

Record  Format 

Software  Reliability  Oataset 
Failure  Interval  Record 


Field 

Descri pti on 

Format 

Comments 

1-  4 

Project  Code 

A  ( 4 ) 

5-  9 

Failure  Number 

1(5) 

Sequentially  assigned 

10-  19 

Failure  Interval  Length 

1(9) 

In  seconds 
(see  note) 

20-  24 

Day  of  Failure 

1(5) 

From  start  of  project 
phase  or  data  collection 
period 

Mote 

Failure  intervals  for  project  6 

are  in  CPU  seconds; 

wall  clock  time  is  used 

for  all  other  projects 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Component  Information  File  (CIF) 


Field 

Descripti  on 

Format 

Comments 

1-  2 

Project  Code 

1(2) 

3-  10 
11-  13 

Fil ler 

Component  Code 

1(3) 

14-  15 

Panvalet  Level  Number 

1(2) 

16-  17 

Module  Function 

1(2) 

From  Encoding  Dictionary 

18-  19 

System/Subsystem  Function 

1(2) 

1  =  I/O  processing 

2  =  Algorithmic 

3  =  Logic  control 

20 

Origin 

Kl) 

4  =  System  related 

5  =  Data/COMMON  block 

6  =  Other 

Blank  =  no  response 

From  Encoding  Dictionary 

21-  24 

Number  of  Executable  Statements 

I  4) 

1  *  New  code 

2  »  Extensively  modified  code 

3  3  Slightly  modified  code 

4  =  Copy  of  existing  code 

25-  28 

Number  of  Lines  with  Comments 

1(4) 

29-  31 

Number  of  Comment  Lines 

1(4) 

32-  34 

Number  of  Unique  Operators 

1(3) 

35-  37 

Number  of  Unique  Operands 

1(3) 

38-  41 

Total  Number  of  Operators 

1(4) 

42-  45 

Total  Number  of  Operands 

1(4) 

46-  48 

Number  of  I/O  Variables  from  Module 

1(3) 

49-  51 

Number  of  Decisions 

1(3) 

McCabes  measure 

52-  54 

Number  of  FUNCTION  References 

1(3) 

55-  57 

Number  of  I/O  Statements 

1(3) 

58-  60 

Number  of  Assignment  Statements 

1(3) 

51-  63 

Number  of  CALL  Statements 

1(3) 

64-  66 

Number  of  FORMAT  Statements 

1(3) 

67 

Status  Flag 

1(1) 

1  =  Unchecked  from  GSFC 

2  =  Unchecked  from  UM 
5  =  Hand  checked 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Comment  File  (CMT) 


Field 

Description 

Format 

Comments 

1-  6 

Form  Number 

A  ( 6 ) 

Eg. 

D00633 

7-  8 

Sequence  Number 

1(2) 

9 

Comment  Type 

A(  1) 

C  3 

Comment 

D  3 

Description 

R  3 

Reason 

U  3 

Useful  item 

11-  12 

Project  Code 

1(2) 

13 

Comment  is  Continued 

A  ( 1 ) 

Y  3 

Yes 

N  3 

No 

14-103 

Text  of  Comment 

A(  90 ) 

104 

Status  Flag 

1(1) 

1  3 

Unchecked  from  GSFC 

2  =  Unchecked  from  UM 
5  =  Hand  checked 


C-2 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Change  Report  File  (CRF) 


Field  Description 

1-  6  Form  Number 
7-  8  Project  Code 
9-  13  Programmer  Code 
14-  19  Form  Date 

20-  21  Number  of  Components  Changed 

22-  23  Number  of  Components  Examined 

24  More  than  One  Component  Affected 

25-  30  Date  Change  was  Determined 
31-  36  Date  Change  was  Started 
37  Effort  for  Change 


38-  41  Type  of  Change 


42-  56  Codes  of  changed  Components 
57-  60  Type  of  Error 


61  Phase  when  Error  Entered  System 


62  Data  Structure  Error  Flag 

63  Control  Logic  Error  Flag 


Format  Comments 

A(6)  Eg.  D00633 
1(2) 

1(5) 

1(6)  YYMMDD 

1(2)  May  be  greater  than  5 

1(2) 

1(1)  To  be  added 
1(6)  YYMMDD 
1(6)  YYMMDD 

1(1)  From  Encoding  Dictionary 

1  *  Less  than  one  hour 

2  =  One  hour  to  a  day 

3  *  One  day  to  three  days 

4  =  Over  three  days 
Blank  ■  no  response 

4A( 1)  From  Encoding  Dictionary 
Up  to  4  responses 

1  =  Error  correction 

2  -  Planned  Enhancement 

3  =  Implement  Req.  change 

4  =  Improve  clarity 

5  =  Improve  user  service 

6  =  Develop  utility  only 

7  *  Optimization 

8  *  Adapt  to  envir.  change 

9  =  Other 

Blank  *  no  response 
51(3)  Up  to  five  component  codes 
41(1)  From  Encoding  Dictionary 
Up  to  four  responses 

1  =  Req.  incorrect 

2  =  Funct.  Specs,  incorrect 

3  =  Design  error  several  comps 

4  =  Design  error  one  comp. 

5  =  Misunderstand  ext.  envir. 

6  *  Error  in  language  usage 

7  =  Clerical  error 

8  =  Other 

Blank  =  no  response 
1(1)  From  Encoding  Dictionary 

1  =  Requirements  Def. 

2  =  Functional  Specification 

3  =  Design 

4  =  Code  and  Test 

5  =  System  Test 

6  =  Unknown 

Blank  =  no  response 
A(  1)  X  =  Yes 

Blank  =  No 
A  ( 1 )  X  =  Yes 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Change  Report  File  (CRF)  continued 

Field  Description  Format  Comments 


ERROR  ISOLATION  ACTIVITIES 
64-  68  For  Program  Validation 

69-  73  For  Detecting  Symptoms 

74-  78  Tried  in  Finding  Cause 

79-  83  For  Finding  Cause 

84  Time  to  Isolate  Error 

35  Work  Around  Used  Flag 
86  Related  to  Previous  Change  Flag 
87-  91  Previous  Form  Number 

92-  97  Previous  Form  Date 

98  Reason  Comment  Flag 

99  Descriptive  Comment  Flag 

100  General  Comment  Flag 

101  Status  Flag 


From  Encoding  Dictionary 
5A( 1)  Up  to  five  response  each 

1  =  Pre-acceptance  test 

2  =  Acceptance  test 

5A( 1)  3  =  Post-acceptance  test 

4  =  Inspection  of  output 

5  =  Code  reading  by  prgmr. 

5A( 1 )  6  =  Code  reading  by  another 

7  =  Talk  with  other  prgmrs. 

8  =  Special  debug  code 

5A( 1)  9  =  System  error  message 

A  =  Project  specif,  err.  msg. 

B  =  Reading  Documentation 

C  =  Trace 
D  »  Dump 

E  »  Cross  reference 
F  =  Proof  Technique 
G  =  Other 

Blank  *  no  response 
1(1)  From  Encoding  Dictionary 

1  =  Less  than  one  hour 

2  =  One  hour  to  a  day 

3  =  More  than  one  day 

4  *  Never  found 
Blank  =  no  response 

A(  1 )  Y  =  Yes 
N  =  No 

Blank  =  no  response 
A  ( 1 )  Y  =  Yes 
N  =  No 

Blank  =  no  response 
1(5)  Excludes  first  character, 
includes  leading  zeroes 
eg.  00633 
1(6)  YYMMDD 
A ( 1 )  Y  =  Yes 

N  =  No 
A( 1 )  Y  =  Yes 
N  =  No 
A( 1 )  Y  =  Yes 
N  =  No 

1(1)  1  =  Unchecked  from  GSFC 

2  =  Unchecked  from  UM 
5  =  Hand  checked 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Component  Summary  File  (CSF) 


Field 

Description 

Format 

Comments 

1-  6 

Form  Number 

A(  6) 

Eg.  100633 

7-  8 

Project  Code 

1(2) 

9-  13 

Programmer  Filling  out  Form 

1(5) 

Programmer  Code 

14-  18 

Programmer  Implementing  Component 

1(5) 

Programmer  Code 

19-  24 

Form  Date 

1(6) 

YYMMDD 

25 

Form  Stage 

A  ( 1 ) 

N  =  New 

U  =  Under  development 

C  =  Complete 

26-  28 

Component  Code 

1(3) 

29 

Precision  of  Specification 

Kl) 

From  Encoding  Dictionary 

1  =  Very  Precise 

2  =  Precise 

3  =  Imprecise 

Blank  ■  no  response 

30 

Complexity 

A(  1) 

E  =  Easy 

M  =  Moderate 

H  =  Hard 

Blank  =  no  response 

31-  33 

Type  of  Software 

31(1) 

From  Encoding  Dictionary 

Up  to  3  responses 

1  =  I/O  processing 

2  =  Algorithmic 

3  =  Logic  control 

4  *  Systems  related 

5  =  Data/COMMON  block 

6  =  Other 

Blank  =  no  response 


34-  36 

Percent  Assignment  Statements 

1(3) 

37-  39 

Percent  Control  Statements 

1(3) 

40-  42 

Percent  Other  Statements 

1(3) 

43-  47 

Number  of  Stmts  w/o  Comments 

1(5) 

48-  52 

Number  of  Stmts  w /  Comments 

1(5) 

53-  57 

Number  of  Machine  Bytes 

1(5) 

58 

Independent  of  Other  Software  Flag 

A(  1 ) 

Y  =  Yes 

N  =  No 

Blank  =  no  response 

59 

Relation  to  Other  Software 

1(1) 

From  Encoding  Dictionary 

1  =  Inserted  at  lower  level 

2  =  New  driver  or  interface 

3  =  Redesi  'n  existing  comps 

4  =  Rename  existing  comps. 

5  =  Regroup  exist,  material 

6  =  Other 

Blank  =  no  response 

60-  63  Type  of  Addition  to  Project  4A(1)  From  Encoding  Dictionary 

Up  to  4  responses 

1  =  Error  correction 

2  =  Planned  enhancement 

3  =  Implement  Req.  change 

4  =  Improve  clarity 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Component  Summary  File  (CSF)  continued 

Field  Description  Format  Comments 

60-  63  Type  of  Addition  to  Project  continued  4A(1)  5  =  Improve  'jse-  service 

6  =  Develop  u-ility  only 

7  =  Optimization 

8  =  Adapt  to  envir  change 

9  =  Other 

Blank  =  no  response 


64-  65 

Number  of  Components  Called 

1(2) 

66 

Filler 

67-  68 

Number  of  Calling  Components 

1(2) 

69 

Filler 

70-  71 

Number  of  Shared  Items 

1(2) 

72 

Filler 

73-  74 

Number  of  Components  Descending 

1(2) 

75 

Filler 

76-  77 

Primary  Language  Used 

1(2) 

From  Encoding  Dictionary 
1  =  FORTRAN 

2  =  Assembly 

Blank  =  no  response 

78-  80 

Percent  of  Primary  Language 

1(3) 

81-  82 

Secondary  Language  Used 

1(2) 

From  Encoding  Dictionary 

1  =  FORTRAN 

2  =  Assembly 

Blank  *  no  response 

83-  85 

Percent  of  Secondary  Language 

1(3) 

FORM  OF  DESIGN 

86-  87 

Levels  Using  Functional  Design 

21(1) 

Up  to  2  responses  each 

88-  89 

Levels  using  Procedural  Design 

21(1) 

1  =  Component  level 

90-  91 

Levels  Using  English  Design 

21(1) 

2  =  Subcomponent  level 

92-  93 

Levels  Using  Formal  Design 

21(1) 

3  =  Sasic  block  segment 

94-  95 

Levels  Using  Other  Design  Form 

21(1) 

4  =  Statement  level 

5  =  Other 

Blank  =  no  response 

CONSTRAINTS 

96-  97 

Memory  Space  Constraint 

A(2) 

Responses  for  each 

98-  99 

Execution  Time  Constraint  Flag 

A(2) 

X  =  Yes 

100-101 

Other  Constraint  Flag 

A(2) 

Blank  =  No 

102-104 

Number  of  Design  Runs  on  Computer 

1(3) 

105-107 

Number  of  Code  Runs  on  Computer 

1(3) 

108-110 

Number  of  Test  Runs  on  Computer 

1(3) 

111-113 

Computer  Time  for  Design 

F  ( 3  - 1 ) 

Tenths  of  CPU  minutes 

114-116 

Computer  Time  for  Code 

F  ( 3 . 1 ) 

Tenths  of  CPU  minutes 

117-119 

Computer  Time  for  Test 

F  ( 3 . 1 ) 

Tenths  of  CPU  minutes 

120-122 

Effort  for  Design 

F  ( 3 . 1 ) 

Tenths  of  manhours 

123-125 

Effort  for  Code 

F(3.1) 

Tenths  of  minutes 

126-128 

Effort  for  Test 

F  ( 3 . 1 ) 

Tenths  of  manhours 

129-134 

Estimated  Design  Phase  End  Date 

1(6) 

YYMMDD 

135-140 

Estimated  Coding  Phase  End  Date 

1(6) 

YYMMDD 

141-146 

Estimated  Testing  Phase  End  Date 

1(6) 

YYMMDD 

147 

Description  Comment  Flag 

A(  1 ) 

Y  =  Yes 

Record  Format 

Nasa-SEL  Software  Engineering  Dataset 
Component  Summary  File  (CSF)  continued 


Field 

Description 

Format 

Comments 

148-162 

Components  Called 

51(3) 

Up  to  5  component  codes 

163-177 

Calling  Components 

51(3) 

Up  to  5  component  codes 

178-192 

Shared  Components  (Items) 

51(3) 

Up  to  5  component  codes 

193-207 

Components  affected  by  Reorganization 

51(3) 

Up  to  5  component  codes 

208-227 

Other  Form  of  Design  Name 

A  ( 20 ) 

228-247 

Other  Constraint  Name 

A(20) 

248 

Useful  Items  Comment  Flag 

A(l) 

Y  =  Yes 

N  =  No 

249 

Additional  Comment  Flag 

A(  1) 

Y  =  Yes 

N  =  No 

250 

Status  Flag 

A  ( 1 ) 

1  =  Unchecked  from  GSFC 

2  =  Unchecked  from  UM 
5  =  Hand  checked 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Component  Status  Report  File  (CSR) 


Field 

Description 

Format 

Comments 

1-  6 

Form  Number 

A(6) 

Eg.  800952 

7-  8 

Sequence  Number 

1(2) 

9-  10 

Project  Code 

1(2) 

11-  15 

Programmer  Code 

1(5) 

16-  21 

Form  Date 

1(6) 

YYMMDD 

22-  24 

Component  Code 

1(3) 

25-  60 

Hours  Spent  in  Each  Activity 

9F (4.1) 

Up  to  9  responses  in 
tenths  of  manhours 
Activi  ties: 

1  Design  creation 

2  Design  reading 

3  Design  review 

4  Code  development 

5  Code  readi ng 

6  Code  reviewing 

7  Module  testing 

8  Integration  testing 

9  Test  reviewing 

61-  68 

Other  Activity  Name 

A(  8) 

59-  72 

Hours  Spent  in  Other  Activity 

F  ( 4 . 1 ) 

Tenths  of  manhours 

73 

Status  Flag 

Kl) 

1  =  Unchecked  from  GSFC 

2  =  Unchecked  from  UM 

5  =  Hand  checked 

74 

Phase  Flag 

A(  1 ) 

R  =  Requirements 

D  =  Development 

M  =  Maintenance 

Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Encoding  Dictionary  File  (ENC) 


Fi  el  d 

Descri pti on 

Format 

Comments 

1-  3 

Code  Type 

1(3) 

Numeric  code  identifying 
the  category 

4-  8 

Code 

A  ( 5 ) 

Alphanumeric  code  identi 
a  particular  value 

9-  16 

Abbreviation 

A(8) 

Eg.  J CL ERROR 

17-  60 

Verbal  Description 

A  ( 44 ) 

Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Estimated  Statistics  File  (EST) 


Field  Description  Format 

1-  8  Filler 

9-  10  Project  Code  1(2) 

11-  14  Total  Number  of  Components  1(4) 

15-  18  Total  Number  of  Modules  1(4) 

19-  22  Number  of  New  Modules  1(4) 

23-  26  Number  of  Modified  Modules  1(4) 

27-  32  Number  of  Computer  Runs  1(6) 

33-  38  Number  of  Source  Code  Changes  1(6) 

39-  44  Number  of  Pages  of  Documentation  1(6) 

45-  50  Total  Number  of  Lines  of  Source  Code  1(6) 

51-  56  Number  of  New  Source  Code  Lines  1(6) 

57-  62  Number  of  Modified  Source  Lines  1(6) 

63-  68  Total  Number  of  Executable  Statements  1(6) 

69-  74  Number  of  New  Executable  Statements  1(6) 

75-  80  Number  of  Modified  Executable  Stmts  1(6) 

81-  86  Programmer  Work  Hours  F ( 6 . 1 ) 

87-  92  Management  Work  Hours  F( 6 - 1 ) 

93-  98  Other  (Services)  Work  Hours  F(6.1) 

99-104  IBM  360-95  Computer  Hours  F(6.1) 

105-110  IBM  360-75  Computer  Hours  F ( 6 - 1 ) 

111-116  Other  Computer  Hours  F ( 6 . 1 ) 

117  Status  Flag  1(1) 


118  Active  Flag  A(l) 


119  Project  Category  1(1) 


Comments 


In  tenths  of  manhours 
In  tenths  of  manhours 
In  tenths  of  manhours 
In  tenths  of  hours 
In  tenths  of  hours 
In  tenths  of  hours 

1  =  Unchecked 

2  3  Hand  checked 

3  *  Verified  by  application 
Y  =  Active 

N  *  Inactive 
Blank  =  no  response 

1  3  Attitude  oriented 

2  =  Orbit  oriented 

3  =  Scientific  oriented 

4  =  Database  oriented 

5  3  Tool 

6  3  Real  time 

7  =  Other 

Blank  3  No  response 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Phase  Date  File  (HDR) 


Field 

Description 

1-  8 

Filler 

9-  10 

Project  Code 

11-  12 

Development  Computer 

13-  14 

Target  Computer 

15 

Extent  of  Alien  Computer  Use 
PHASE  DATES 

16-  21 

Requirements  Definition 

Start 

22-  27 

Requi rements  Def i ni ti on 

End 

28-  33 

Oesign  Start 

34-  39 

Design  End 

40-  45 

Code  and  Test  Start 

46-  51 

Code  and  Test  End 

52-  57 

System  Test  Start 

58-  63 

System  Test  End 

64-  69 

Acceptance  Test  Start 

70-  75 

Acceptance  Test  End 

76-  81 

Cleanup  Start 

82-  87 

Cleanup  End 

88-  93 

Maintenance  Start 

94-  99 

Maintenance  End 

100-111 

Spares 

112 

Status  Flag 

Format  Comments. 


1(2) 

1(2)  1  =  IBM  360 

2  =  P DP  11/70 
Blank  =  no  response 
1(2)  1  *  IBM  360 

2  =  PDP  11/70 
Blank  =  no  response 
1(1)  To  be  added 

1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
1(6)  YYMMDD 
A(  12) 

Kl)  1  =  Unchecked  from  GSFC 
2  *  Unchecked  from  UM 
5  *  Hand  Checked 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Growth  History  File  (HIS) 


Field 

Description 

Format 

Comments 

1-  2 

Project  Code 

1(2) 

3-  8 

Date 

1(6) 

YYMMDD 

9-  14 

Number  Commented  Source  Lines 

to  Date  1(6) 

15-  17  Number  of  Modules  to  Date  1(3) 

18-  23  Number  of  Changes  to  Date  1(6) 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Run  Analysis  File  (RAF) 


Field 

Description 

Format 

Comments 

1-  6 

Form  Number 

A(6) 

Eg.  J 006 33 

7-  8 

Sequence  Number 

1(2) 

9-  10 

Project  Code 

1(2) 

11-  15 

Programmer  Code 

1(5) 

16-  21 

Run  Date 

1(6) 

YYMMDD 

22-  23 

Computer  Code 

1(2) 

To  be  added 

24 

Interactive  Flag 

A(  1) 

X  3  Interactive 

Blank  3  no  response 

25-  28 

Run  Purpose 

Kl) 

From  Encoding  Dictionary 
1  3  Uni t  test 

2 1 2  3 4 5 6 7 8 9  System  test 

3  =  Benchmark  test 

4  *  Maintenance  or  utility 

5  =  Compile,  Assm. ,  or  link 

6  =  Debug  run 

7  =  Other 

Blank  *  no  response 


30 

Number  of  Components 

1(2) 

May  be  more  than  5 

45 

Component  Codes 

51(3) 

Up  to  5  component  codes 

46 

First  Run  Flag 

A(1 ) 

X  3  First  run 

Blank  3  no  response 

47 

Run  Met  Objectives 

A  ( 1 ) 

Y  3  Yes 

N  3  No 

Blank  3  no  response 

51 

Run  Results 

4A(  1) 

From  Encoding  Dictionary 
Up  to  4  responses 

1  =  Good  run 

2  =  Submi t  error 

3  3  JCL  error 

4  =  Other  setup  error 

5  =  Hardware  error 

6  =  Software  error 

7  3  Compile  error 

8  3  Link  error 

9  3  Execution  error 

A  3  User  generated  message 
B  =  Ran  to  completion 
Blank  3  no  response 


52 

Comment  Indicator  Flag 

A(  1) 

Y  =  Yes 

N  3  No 

53 

Status  Flag 

Kl) 

1  3  Unchecked  from  GSFC 

2  3  Unchecked  from  UM 
5  =  Hand  checked 


Record  Format 

NASA-SEL  Software  Engineering  Dataset 
Resource  Summary  File  (RSF) 


Field 

Descri pti on 

Format 

Comments 

1-  6 

Form  Number 

A  ( 6 ) 

Eg.  C00633 

7-  8 

Sequence  Number 

1(2) 

9-  10 

Project  Code 

1(2) 

11 

Resource  Type 

A(  1 ) 

M  =  Manpower 

C  =  Computer 

0  =  Other  (Services) 

12-  16 

Resource  Code 

1(5) 

From  Encoding  Dictionary 
(Programmer  code, 
computer  code,  or 
services  code) 

17-  22 

Form  Date 

1(6) 

YYMMDD 

23-  25 

Percent  Management 

1(3) 

26-  31 

Beginning  Date  of  Data 

1(6) 

YYMMDD 

32-108 

Resources  Used 

111(3) ,F(4.1 ) 

Up  to  11  alternating 
fields  representing  number 
of  computer  runs,  number  of 
hours  in  tenths  of  hours 

109 

Status  Flag 

A(  1) 

1  =  Unchecked  from  GFSC 

2  =  Unchecked  from  UM 

5  =  Hand  checked 

110 

Phase  Flag 

A(  1) 

R  =  Requirements 

D  -  Development 

M  =  Maintenance 

Record  Format 

Independent  Verification  and  Validation  Dataset 
Project  Data  Record 


Field 

Description 

Format 

Comments 

1-  2 

Project  Code 

A(2) 

3-  4 

Project  Application 

A(  1) 

C  =  CCCI 

0  =  Other 

6 

Type  of  Development  Effort 

A(  1 ) 

I  *  Initial  Development 

M  =  Modification 

8 

Operating  Mode 

A  ( 1 ) 

R  =  Real-time 

N  =  Non  real-time 

B  3  Both 

10 

Security  Classification 

All) 

C  3  Classified 

U  3  Unclassified 

P  =  Partial  Classification 

12 

Language  Type 

A(l) 

H  3  High  order  language 

A  3  Assembly  language 

B  3  Both 

14-  15 

HOL  Source  Lines 

1(2) 

In  thousands 

17-  18 

Assembly  Source  Lines 

1(2) 

In  thousands 

20 

Programming  Practices 

A  ( 1 ) 

T  3  Traditional 

M  3  Modern 

22 

Use  of  Top  Down  Design  at  System 
Level 

A(  1) 

Y  =  Yes 
•  N  3  No 

X  3  Not  applicable 

24 

Use  of  Top  Down  Design  at  Program 
Level 

A(  1) 

Y  3  Yes 

N  3  No 

26 

Type  of  Program  Support  Library  Used 

A(l) 

F  3  Full 

B  3  Basic 

M  =  Manual 

N  3  None 

28 

Type  of  Structured  Code  Used 

A(  1) 

S  3  Simulated  Constructs 

P  =  Preprocessor 

D  3  Directly  Compilable 

N  =  Not  Used 

30 

Type  of  RADC  Standards  Used 

A  ( 1 ) 

S  3  Standard 

C  3  With  Code  reading 

W  3  With  Design/Code  reviews 
M  3  Not  used 

32 

Type  of  Programmer  Team  Used 

A  ( 1 ) 

F  3  Full  programmer  team 

M  3  Modified  programmer  team 
N  3  Not  used 

34-  37 

Development  Start  Date 

1(4) 

MMYY 

39-  42 

End  Requirements  Definition  Phase 

1(4) 

MMYY 

44-  47 

Start  Design  Phase 

1(4) 

MMYY 

49-  52 

End  Design  Phase 

1(4) 

MMYY 

54-  57 

Start  Coding  Phase 

1(4) 

MMYY 

59-  62 

End  Coding  Phase 

1(4) 

MMYY 

64-  67 

Start  Testing  Phase 

1(4) 

MMYY 

69-  72 

Development  End  Date 

1(4) 

MMYY 

D-l 


Record  Format 

Independent  Verification  and  Validation  Dataset 
Project  Data  Record  continued 


Field  Description 


Format  Comments 


74  Number  of  Errors  Reported  in  1(1) 

Operations 

76-  78  Reason  for  Maintenance  if  Performed  A(3)  Up  to  3  responses 

R  =  Requirements  Change 
E  =  Error  Correction 
0  =  Other 
N  =  None  Performad 

80  Contracter  /  Developer  Relationship  A(l)  G  =  Good 

F  =  Fair 
P  =  Poor 


Record  Format 

Independent  Verification  and  Validation  Dataset 
Anomaly  Data  Record  (ADR) 


Field 

Description 

Format 

Comments 

1-  2 

Project  Code 

A  ( 2 ) 

4-  7 

Anomaly  Report  Number 

A(  4) 

Left  justified 

9-  12 

Report  Part 

A(  4) 

Left  justified 

14-  19 

Report  Date 

1(6) 

MMDDYY 

21-  26 

Analyst  Codes 

31(2) 

Up  to  3  responses 

28-  29 

Anomaly  Location 

A(2) 

S  ~  System/Segment  Specs 

I  =  Interface  Specs. 

R  =  Requirements  Specs. 

D1  3  Pre-code  Design  Specs. 

C  =  Code 

D2  3  Post-code  Design  Specs. 

U  =  User  documentation 
X  =  Other 

32-  33  Anomaly  Category  A(2)  If  Location  3  S,I,or  R 

1  =  Incorrect  Requirements 

2  3  Inconsistent  Reqs. 

3  =  Incomplete  Requirements 
4A  3  Unclear  Requirements 

4B  3  Unfeasible  Reqs. 

4C  3  Extraneous  Reqs. 

5A  3  Standards  not  used 
5B  3  Configuration  mngt. 

5C  3  Other  presentation  prob. 
If  Location  3  D1  or  C 

1  =  Req. /Design  compliance 

2  =  Choice  of  algorithm 

3  =  Sequence  of  operations 

4  =  Data  definition 
5A  =  Initial ization 

5B  =  Addressing,  indexing 
5C  3  Flags 
50  3  Counters 

5E  3  Shared  memory  locations 
5F  3  Other  data  prob. 

6  3  Timing  /  Interruptibil ity 

7A  3  Input  handling 
7B  3  Output 

7C  3  Hardware  interface 
7D  3  External  SW  interface 
7E  3  Routine  interface 
8A  =  Extraneous  Design/Code 
8B  =  Program  error  handling 
8C  =  Other  design/code  prob. 

9A  =  Standards  not  used 
9B  3  Configuration  mngt. 

9C  =  Comments,  Annts. 

9D  =  Other  presentation  prob. 


Record  Format 

Independent  Verification  and  Validation  Dataset 
Anomaly  Data  Record  continued 


Field  Description 
32-  33  Anomaly  Category  continued 


35-  37  Special  Circumstances 


39-  42  Anomaly  Effect 


44-  46  Operational  Effect 

48  Anomaly  Severity 

50-  51  IV&V  Phase  when  Detected 


Format  Comments 

A(2)  If  Location  =  D2  or  U 

1  =  Incorrect  documentation 

2  =  Inconsistent  documentation 

3  =  Incomplete  documentation 

4  =  Other  content  prob. 

5A  =  Standards  not  used 
5B  =  Configuration  mngt. 

5C  =  Other  presentation  prob. 

If  Locaion  =  X 

1  =  Hardware  System  prob. 

2  =  Other  Documentation 

3  =  Unknown  origin 

4  =  Development  process 

5  =  Other  Prob. 

3A(1)  Up  to  3  responses 

P  =  Error  from  correction  of 
previosuly  reported  error 
D  =  Disagreement  among  materials 
with  none  clearly  wrong 
N  =  Non  optimal  decision 
L  =  Latent  error 
H  =  Holdover  from  previously 
IV&V  effort 

C  *  Copy  of  previous  anomaly 
X  =  Other 

4A( 1 )  Up  to  4  responses 

D  =  Development 
V  =  Verifiability 
0  =  Operations 
M  =  Maintainability 
E  =  Ease  of  use 
X  =  Other 

3A( 1)  Up  to  3  responses 
C  -  Correctness 
A  =  Accuracy  /  precision 
S  =  Securi ty 
E  =  Efficiency 
X  =  Other 
A( 1 )  H  =  High 

M  =  Medium 
L  =  Low 
U  =  Unknown 

2A(1)  Up  to  2  responses 

R  =  Requirements  Verification 
D  =  Design  Verification 
C  =  Code  Verification 
T  =  Testing  Verification 
W  =  Documentation  Verification 
X  =  Other 
Blank  =  Unknown 


Record  Format 

Independent  Verification  and  Validation  Dataset 
Anomaly  Data  Record  continued 


Field  Description 

53  Development  Phase  when  Detected 


55-  57  Methods  Used 


59-  64  Tools  Used 


66  Anomaly  Acceptance 


68-  69  Action  Taken 


Format  Comments 

A( 1 )  R  =  Requirements 
D  -  Design 

C  =  Coding  and  Checkout 
T  =  Testing 
P  =  Post-testing 
0  =  Operations 
X  =  Other 
Blank  =  Unknown 
3A(1)  Up  to  3  responses 
M  =  Manual  Analysis 
E  *  Execution  testing 
T  =  Tool  use 
Blank  =  Unknown 
2A( 3 )  Up  to  2  responses 
SIM  =  Simulation 
RTA  =  Real-time  Analyzer 
TED  =  Text  editor 
ERA  =  Extension  register 
analyzer 

MAP  =  Memory  analysis  program 
ICS  =  Interpretative  computer 
simulator 

MSS  =  Missile  system 
simulation 
CMP  =  Compiler 
ASM  -  Assembler 
Blank  =  Unknown  or  none 
A( 1 )  A  =  Accepted  as  written 
C  =  accepted  with  changes 
R  =  Rejected 

W  =  Withdrawn  or  superceded 
U  =  Unknown 
X  =  Other 

A(2)  F  =  Fixed  and  verified 

FW  =  Fixed  found  wrong 

FU  =  Fixed,  unchecked 

C  =  Negated  by  unrelated 

change 

D  =  Deferred 

DF  =  Deferred,fixed  later 

DN  =  Deferred,  not  fixed 

W  =  Work  around  used 

N  =  No  action  taken 

0  =  Still  open  '/j 

U  =  Unkown  ;j 

P  =  Partially  fixed  £< 


Record  Format  . 

Independent  Verification  and  Validation  Dataset 
Anomaly  Data  Record  continued 


Field  Description 


Format  Comments 


71-  73  Materials  Changed 


A ( 3 )  Up  to  3  responses 

S  =  System/subsystem  spec. 
I  =  Interface  Spec. 

R  =  Requirements  Spec. 

D  =  Design  Spec. 

C  =  Code 

U  =  User  documentation 
X  =  Other 
Blank  =  Unknown 
1(6)  MMDDYY 


75-  80  Resolution  Date 


APPENDIX  E 


RECORD  FORMATS  for  the  ARF  ERROR  DATASET 


Project  level  data  is  recorded  in  the  same  format  as  data  contained  in 
the  DACS  Productivity  Dataset.  See  Appendix  A  of  this  report  for  these 
formats. 


Error  Report  data  is  recorded  in  the  same  format  as  Chage  Report  data 
recorded  in  the  NASA/SWL  dataset.  See  the  record  formats  for  the  Change 
Report  File  in  Appendix  C  of  this  report  for  this  information 

Segment  or  Module  Descriptive  Data  is  recorded  in  the  following  formats: 


Location 

Parameter 

Format 

Comments 

1  -  2 

Project  Code 

1(2) 

3-5 

Component  Code 

1(3) 

001-253 

6-9 

Total  Statements  in  Segment 

1(4) 

10  -  12 

Number  of  Conments  in 

Segment 

1(3) 

13  -  15 

Number  of  Pre- Processor 
Statements  in  Segment 

1(3) 

16 

Subjective  Complexity 

A(l) 

E  -  Easy 

M  -  Moderate 


APPENDIX  F 


RECORD  FORMATS  for  the 
O&M  DATASET 


Field 

1-  2 
3-  7 
8-27 
28-29 
30-35 
36-50 
51-70 
71-75 
76-78 

79 

80 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Programmer  Experience  Profile  (PEP)  Record  1 


Description 
Record  Type 

Personnel  Identification  Code 

Filler 

Age 

Date  Profile  Completed 

Project  Name 

Job  Title 

Position 

Section  Name 

Years  of  College  Education 
Years  of  High  School  Education 


Format 

Comments 

A(2) 

"1A" 

A(  5) 

Uniquely  Assigned 

1(2) 

1(6) 

YYMMDD 

A(  15) 

A(  20) 
A(4) 

Eg.  GS12 

A(3) 

Eg.  ADQ 

Id) 

KD 

F-l 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Programmer  Experience  Profile  (PEP)  Record  2 


Field 

Description 

Format 

Comments 

1-  2 

Record  Type 

A(2) 

"IB" 

3-  4 

Year  Graduated 

1(2) 

YY  (High  School 
no  College) 

5-  7 

First  College  Degree 

A(3) 

Eg.  BS,  AAS 

8-  9 

Degree  Year 

1(2) 

YY 

10-13 

Major  Course  of  Study 

A(  4) 

14-16 

Second  College  Degree 

A(3) 

Eg.  MS,  MEE 

17-18 

Degree  Year 

1(2) 

YY 

19-22 

Major  Course  of  Study 

A(4) 

23-25 

Third  College  Degree 

A(3) 

Eg.  PHD,  MAT 

26-27 

Degree  Year 

1(2) 

YY 

28-31 

Major  Course  of  Study 

A(4) 

32-33 

Number  Computer  Science  Courses 

1(2) 

34-35 

Number  Computer  Science  Semester  Hours 

1(2) 

36-37 

Number  Computer  Science  Seminars 

1(2) 

38-39 

Number  Years  with  Computers 

1(2) 

40-41 

Percent  of  Years  Individual  Effort 

1(2) 

42-43 

Percent  of  Years  Team  Effort 

1(2) 

44-45 

Percent  of  Years  Supervisor 

1(2) 

46-47 

Number  Years  w/Structured  Programming 

1(2) 

To  nearest  year 

48-49 

Number  Years  w/P.DL 

1(2)  • 

To  nearest  year 

50-51 

Number  Years  w/HIPO  Charts 

1(2) 

To  nearest  year 

52-53 

Number  Years  w/Top  Down  Development 

1(2) 

To  nearest  year 

54-55 

Number  Years  w/PSL 

1(2) 

To  nearest  year 

56-57 

Number  Years  w/Precompi lers 

1(2) 

To  nearest  year 

58-59 

Number  Years  w/Chief  Programmer  Team 

1(2) 

To  nearest  year 

60-61 

Number  Years  w/Other  Techniques 

1(2) 

To  nearest  year 

62-63 

Years  Experience  with  JOVIAL 

1(2) 

To  nearest  year 

64-65 

Years  Experience  with  Assembler 

1(2) 

To  nearest  year 

66-67 

Years  Experience  with  FORTRAN 

1(2) 

To  nearest  year 

63-69 

Years  Experience  with  COBOL 

1(2) 

To  nearest  year 

70-71 

Years  Experience  with  ALGOL 

1(2) 

To  nearest  year 

72-73 

Years  Experience  with  PL/1 

1(2) 

To  nearest  year 

74-75 

Years  Experience  with  PASCAL 

1(2) 

To  nearest  year 

76-77 

Years  Experience  with  Other  Languages 

1(2) 

To  nearest  year 

Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Programmer  Experience  Profile  File  (PEP)  Record  3 


Field 

Description 

Format 

Comments 

1-  2 

Record  Type 

A(2) 

“1C" 

3-  4 

Number  Years  w/Business  Appl. 

1(2) 

To  nearest  year 

5-  6 

Number  Years  w/Scientif ic/Math  Appl. 

1(2) 

To  nearest  year 

7-  8 

Number  Years  w/Sys.  Prog.  Appl. 

1(2) 

To  nearest  year 

9-10 

Number  Years  w/Real-time  Appl. 

1(2) 

To  nearest  year 

11-12 

Number  Years  w/Database  Appl. 

1(2) 

To  nearest  year 

13-14 

Number  Years  w/Other  Appl. 

1(2) 

To  nearest  year 

15-21 

Name  of  Primary  Computer 

A(7) 

Eg.  IBM360 

22-28 

Name  of  Primary  Operating  System 

A(  7) 

Eg.  DOS 

29-30 

Experience  on  Primary  System 

1(2) 

To  nearest  year 

31-37 

Name  of  Secondary  Computer 

A(7) 

38-44 

Name  of  Secondary  Operating  System 

A(  7) 

45-46 

Experience  on  Secondary  System 

1(2) 

To  nearest  year 

47-53 

Name  of  Tertiary  Computer 

A(  7) 

54-60 

Name  of  Tertiary  Operating  System 

A(7) 

61-62 

Experience  on  Tertiary  System 

1(2) 

To  nearest  year 

Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Discrepancy  Report  History  File  (DRH)  Record  1 


Field 

Description 

Format 

Comments 

1-  2 

Record  Type 

A(2) 

"2A" 

3-  7 

Discrepancy  Report  (DR)  Number 

A(5) 

YNNNN 

Y  =  Last  Digit  of  Year 
NNNN  *  Sequence  Number 

9-11 

DR  Origin 

A(3) 

Eg.  7th 

13-16 

Local  DR  Number 

A(4) 

Eg.  B270 

18-57 

DR  Description 

A(40) 

59-65 

Date  DR  Received 

A(7) 

Eg.  10  Oct  79 

67-69 

Responsible  Section 

A(3) 

Eg.  TAC 

71 

DR  Priority  A(l) 

Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Discrepancy  Report  History  File  (DRH)  Record  2 

R  *  Routine 

U  =  Urgent 

E  *  Emergency 

Field 

Description 

Format 

Comments 

1-  2 

Record  Type 

A(2) 

"2B" 

3-  9 

Date  Analysis  Started 

A(7) 

Eg.  06  Nov  79 

11-17 

PMR/PCD/PDDR  Open  Date 

A(  7) 

Eg.  12  Dec  79 

19-57 

Remarks 

A(40) 

Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Discrepancy  Report  History  File  (DRH)  Record  3 


Field 

Description 

Format 

Comments 

1-  2 

Record  Type 

A(2) 

"2C" 

1 

3-17 

Fi 1 ler 

19-25 

Estimated  Completion  Date 

A(7) 

Eg- 

10  Feb  80 

27-33 

Scheduled  Version  Release 

A(  7) 

Eg. 

PTAC-DO 

39-41 

Scheduled  Operation  Date 

A(7) 

Eg. 

12  Apr  80 

43-49 

Date  Forwarded  to  NCCB 

A(7) 

Eg. 

18  Dec  79 

51-53 

NCCB  Action 

A(3) 

APP 

=  Approved 

RE  3 

=  Rejected 

DEF 

=  Deferred 

55-61 

Date  of  NCCB  Action 

A(7) 

Eg. 

31  Dec  79 

63-69 

Date  DR  Closed 

A(7) 

Eg. 

13  Apr  30 

F-4 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Maintenance  Activity  File  (MAF)  Record  1 


Field 

Description 

Format 

Comments 

1-  5 

Discrepancy  Report  (DR)  Number 

1(5) 

6-11 

Oate  DR  Submitted 

1(6) 

YYMMDD 

12-15 

CPC I  Affected 

A(4) 

16-19 

CPCG  Affected 

A(4) 

20-24 

CPC  Affected 

A(5) 

Blank  if  more  than  one 

25-26 

Primary  Maintenance  Activity 

A(2) 

El  =  Error  correction 

A1  =  Add  capability 

D1  *  Delete  capability 

01  3  Optimize/enhance 

27-28 

Secondary  Maitnenance  Activity 

A(2) 

E2  *  Error  Correction 

A2  3  Add  capability 

D2  3  Delete  capability 

02  3  Optimize/enhance 

81ank  3  not  needed 

29-30 

Version  Release  Affected 

A(2) 

31-32 

Precision  of  Change  Specification 

A(2) 

VP  *  Very  Precise 

P  =  Precise 

IM  =  Imprecise 

33 

Urgency  Code 

A(l) 

E  3  Emergency 

U  =  Urgent 

R  -  Routine 

34-35 

Complexity  of  Change 

A(2) 

VC  3  Very  complex 

C  3  Complex 

M  3  Medium  Complexity 

S  3  Simple 

VS  3  Very  Simple 

36-37 

Primary  Means  of  Detection 

A(2) 

1  Response  each  for  two  fie 
HP  3  Hand  processing 

38-39 

Secondary  Means  of  Detection 

A(2) 

PC  3  Personal  communication 
IL  3  Infinite  loop 

MC  3  Maintenance  Crosscheck 
IE  3  Interrupt  error 

10  3  Incorrect  output 

MO  3  Missing  output 

EM  3  Error  message 

CR  3  Code  review 

DR  3  Documentation  review 

SD  3  Special  debug  code 

OT  3  Other 

40-46 

Programmer  Code 

A(7) 

47-50 

Manhours  to  Implement  Change 

1(4) 

51-54 

Computer  Hours  for  Change 

F  ( 4 . 1 ) 

Tenths  of  CPU  Hours 

55-57 

Number  of  Segments  Affected 

1(3) 

Specifies  number  of  Type  2 
records  following 

58-63 

Sources  of  Error 

F-5 

3A(2) 

Up  to  three  responses 

MS  3  Specs.  Misinterpreted 
IS  3  Specs.  Incorrect 

NS  3  Specs.  Incomplete 

Field 

58-63 


64-69 


70-75 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Maintenance  Activity  File  (MAF)  Record  1  continued 


Description 


Format  Comments 


Sources  of  Error  continued 


Nature  of  Change 


3A(2) 


Type  of  New  Requirement  3A(2) 


SF  =  Function  not  Implementec 

SI  =  Interface  not  Implemente 

SO  =  Software  Interface 

HI  =  HW/SW  Interface 

OS  =  Operating  System 

LE  =  Logic  Error 

CE  *  Computational  Error 

DE  *  Data  I/O  Error 

DD  -  Data  Definition  Error 

CN  *  Cause  Not  Found 

10  =  I/O  software 

PM  =  Prior  Modification 

SS  =  Support  Software 

DS  =  Deck  Setup  Error 

OE  =  Operator  Error 

OT  *  Other 

Up  to  3  responses 

DO  =  Documentation 

FI  *  Fix  Instruction 

CC  *  Change  Constants 

ST  =  Structural  Change 

AL  »  Algorithmic 

OT  =  Other 

Up  to  3  responses 

MI  =  Mission  Changed 

EM  =  New  Engineering  Model 

SW  =  More  Efficient  Software 

HW  *  Change  in  Hardware 

SS  =  New  Support  Software 

OT  =  Other 


F-6 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Maintenance  Activity  File  (MAF)  Record  2 


Field 

Description 

Format 

Comments 

1-  5 

Discrepancy  Report  Number 

1(5) 

6 

Segment  Status 

A(l) 

A  *  Segment  being  added 

D  =  Segment  being  deleted 
C  =  Segment  being  changed 

7-46 

Name  of  Segment  Affected 

A(  35) 

F-7 


Field 

1-  4 
5-13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27-32 

33-35 

36-38 

39-41 

42-45 

46-51 

52-57 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
CPCG  Description  File  (CDF) 


Description 

CPC I  Name 
CPCG  Name 

Special  Display  constraint 
Detailed  Requirements  Definition 
Change  to  Operational  Requriements 
Real  Time  Operation 
CPU  Memory  Constraint 
CPU  Time  Constraint 
First  Software  Developed  on  CPU 
Developed  Concurrently  with  HW 
Time  Sharing  (vs  Batch) 

Developer  used  other  Facility 
Operational  Site  Development 
Not  Developed  on  Target  System 
Programmer  Access  to  Computer 

PSL  Management  Report  Date 
PSL  Library  Level 
Source  of  Data 

Number  of  Programs 
Number  of  Segments 
Number  of  Source  Lines 
Words  of  Object  Code 


Format  Comments 

A(4) 

A(9) 

A(  1 )  Y  =  Yes  N  =  No 

A(  1)  Y  3  Yes  N  =  No 

A(l)  Y  3  Yes  N  3  No 

A(  1)  Y  =  Yes  N  3  No 

A( 1 )  Y  =  Yes  N  3  No 

A(l)  Y  3  Yes  N  3  No 

A( 1)  Y  3  Yes  N  3  No 

A(  1)  Y  3  Yes  N  3  No 

A(l)  Y  3  Time  Sharing 
N  3  Batch 

A(l)  Y  3  Yes  N  3  No 

A(l)  Y  3  Yes  N  =  No 

A(  1)  Y  3  Yes  N  3  No 

A( 1)  Y  =  Direct  Access 

N  3  Indirect  Access 
1(6)  YYMMDD 

A(3) 

A(3)  SEG  3  PSL  Summary  by  Segment 

PRG  3  PSL  Summary  by  Program 

1(3) 

1(4) 

1(6) 

1(6) 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Program  Change  History  File  (PCH) 


Field 

Description 

Format 

Comments 

1-40 

Program  Long  name 

A(40) 

42-47 

Program  Short  name 

A(6) 

49-52 

Language 

A(  4) 

55-61 

Date  Program  was  Last  Changed 

A(8) 

YY/MM/DD 

64-71 

Time  Program  was  Last  Changed 

A(8) 

HH.MM.SS 

72-76 

Number  of  Segments  in  Program 

1(5) 

77-82 

Total  Size 

1(6) 

In  source  lines 

83-86 

Number  of  Stubs 

1(4) 

88-89 

Program  Version 

A(2) 

Maximum  of  all  segment  vers 

90-93 

Program  Edition 

1(4) 

Sum  of  segment  adit  ions 

94-98 

Program  Instance 

1(5) 

Incremental  for  each  compil 

101-108 

Date  Compiled 

A(8) 

YY/MM/DD 

110-117 

Time  Compiled 

A(8) 

HH.MM.SS 

118-123 

Object  Size 

1(6) 

In  decimal  words 

Record  Format 

PAVE  PAWS  Operations  and  Maintenance 
CPCG  Status  File  (CSF) 

Dataset 

Field 

Description 

Format 

Comments 

1-  4 

CPCG  Name 

A(4) 

5-10 

PSL  Management  Report  Date 

1(6) 

YYMMDD 

11-15 

Effective  Code  at  PSL  PRG  Level 

1(5) 

In  source 

lines 

16-17 

Highest  Version  at  PRG  Level 

A(2) 

18-22 

Effective  Code  at  PSL  CPT  Level 

1(5) 

In  source 

lines 

23-24 

Highest  Version  at  CPT  Level 

A(2) 

25-29 

Effective  Code  at  PSL  INT  Level 

1(5) 

In  source 

lines 

30-31 

Highest  Version  at  INT  Level 

A(2) 

32-36 

Effective  Code  at  PSL  FIX  Level 

1(5) 

In  source 

lines 

37-38 

Highest  Version  at  FIX  Level 

A(2) 

39-43 

Effective  Code  at  PSL  TST  Level 

1(5) 

In  source 

1  ines 

44-45 

Highest  Version  at  TST  Level 

A(2) 

46-50 

Effective  Code  at  PSL  FRZ  Level 

1(5) 

In  source 

lines 

51-52 

Highest  Version  at  FRZ  Level 

A(2) 

53-57 

Effective  Code  at  PSL  DEL  Level 

1(5) 

In  source 

lines 

58-59 

Highest  Version  at  DEL  Level 

A{2) 

60-64 

Durable  Code  at  PSL  PRG  Level 

1(5) 

In  source 

lines 

65-66 

Highest  Version  at  PRG  Level 

A(2) 

67-71 

Durable  Code  at  PSL  CPT  Level 

1(5) 

In  source 

lines 

72-73 

Highest  Version  at  CPT  Level 

A(2) 

74-78 

Durable  Code  at  PSL  INT  Level 

1(5) 

In  source 

1  ines 

79-30 

Highest  Version  at  INT  Level 

A(2) 

81-85 

Durable  Code  at  PSL  FIX  Level 

1(5) 

In  source 

1  ines 

36-37 

Highest  Version  at  FIX  Level 

A(2) 

33-92 

Durable  Code  at  PSL  TST  Level 

1(5) 

In  source 

lines 

93-94 

Highest  Version  at  TST  Level 

A(2) 

95-99 

Durable  Code  at  PSL  FRZ  Level 

1(5) 

In  source 

lines 

100-101 

Highest  Version  at  FRZ  Level 

A(2) 

102-106 

Durable  Code  at  PSL  DEL  Level 

1(5) 

In  source 

lines 

107-108 

Highest  Version  at  DEL  Level 

A(2) 

109-114 

Last  Change  at  PRG  Level 

1(6) 

YYMMDD 

115-120 

Last  Change  at  CPT  Level 

1(6) 

YYMMDD 

121-126 

Last  Change  at  INT  Level 

1(6) 

YYMMDD 

127-132 

La.st  Change  at  FIX  Level 

1(6) 

YYMMDD 

133-138 

Last  Change  at  TST  Level 

1(6) 

YYMMDD 

139-144 

Last  Change  at  FRZ  Level 

1(6) 

YYMMDD 

145-150 

Last  Change  at  DEL  Level 

1(6) 

YYMMDD 

-i  <ni  m  vi  O'  rH  in  10  • j  in  O  'f  CO  r--> 


Record  Format 

PAVE  PAWS  Operations  and  Maintenance  Dataset 
Segment  Change  History  File  (SCH) 

-  i  e 1 d  Description  Format  Comments 


-40  Segment  Longname 

-47  Segment  Shortname 

-52  Language 

-57  Segment  Type 

-66  Date  Segment  was  Created 

-70  Current  Source  Lines  in  Segment 

-74  Gross  Size  of  Segment 

-33  Date  Segment  Last  Changed 

-92  Time  Segment  Last  Changed 

-95  Segment  Version 

-99  Segment  Edition 

-103  Total  Changes  to  Segment 

-107  Changes  to  Current  Segment  Version 

-111  Gross  Size  of  Current  Version 

-118  Identification  of  Segment  Creater 

120  Special  Circumstances  Flag 

23-130  Identification  of  Person  Last  Changing 
Segment 


A(  40) 

A(6) 

A(4) 

A  ( 4 ) 
A(3) 
1(4) 
1(4) 
A(3) 
A(S) 
A(2) 
1(4) 
1(4) 
1(4) 
1(4) 
A(6) 
Ad) 
A(6) 


YY/MM/DD 

includes  deleted  lines 

YY/MM/DD 

HH.MM.SS 


includes  deleted  lines 


GLOSSARY  OF  DATA- ITEM  NAMES  IN  THE  BSDS  DATASET 


PROJ-ID 

PROJECT  IDENTIFICATION 

PROJ-VERSION 

PROJECT  VERSION 

PROJ-TYPE 

PROJECT  TYPE 

SYS-ID 

SYSTEM  IDENTIFICATION 

SYS-VERSION 

SYSTEM  VERSION 

SYS-TYPE 

SYSTEM  TYPE 

SSYS-ID 

SUBSYSTEM  OR  FUNCTIONAL  AREA  IDENTIFICATION 

SSYS-VERSION 

SUBSYSTEM  VERSION 

SSYS-TYPE 

SUBSYSTEM  TYPE 

MOD- ID 

MODULE  IDENTIFICATION 

MOD-VERSION 

MODULE  VERSION 

MOD-TYPE 

MODULE  TYPE 

COMP- ID 

COMPUTER  IDENTIFICATION 

COMP-OM 

COMPUTER  OPERATING  MC JE 

COMP -RATE 

COMPUTER  PROCESSING  RATE 

COMP-OS 

COMPUTER  OPERATING  SYSTEM  TYPE 

TECH-ID 

IDENTIFICATION  OF  THE  CONSTRUCTION  TECHNOLOGY 

COMPL-ID 

TYPE  OF  COMPLEXITY  MEASURE  USED 

COMPLEXITY 

THE  COMPLEXITY  MEASURE  VALUE 

CONST-TYPE 

CONSTITUENT  TYPE(EX.  JOVIAL, ASSEMBLY  LANGUAGE) 

NUM-OCCUR 

NUMBER  OF  OCCURRENCES  OF  CONSTITUENT  TYPE 

PHASE 

PHASE  IN  WHICH  ACTION  OCCURRED 

NUM-RUNS-TOT 

TOTAL  NUMBER  OF  RUNS 

TEST-PER 

THE  PERIOD  IN  WHICH  THE  TEST  WAS  PERFORMED 

NUM-RUNS-OK 

TOTAL  NUMBER  OF  CORRECT  RUNS 

AHRS-PER-TEST 

AVERAGE  NUMBER  OF  HOURS  PER  TEST 

TEST-ID 

TEST  IDENTIFICATION 

TEST-TYPE 

TYPE  OF  TEST 

DATE-RUN 

DATE  THE  TEST  WAS  RUN 

STRESS-TYPE 

TYPE  OF  STRESS  APPLIED 

STRESS-MEAS 

AMOUNT  OF  STRESS  APPLIED 

TEST-RESULT 

RESULT  OF  TEST 

G-l 


GLOSSARY  OF  DATA- ITEM  NAMES  IN  THE  3SDS  DATASET 


NUM-ERR 

SPR-NUM 

DATE-OPEN 

MOD-SOURCE 

ERR-CAT-TYPE 

ERROR-CAT 

SEV-TYPE 

SEVERITY 

TYPE-TERM 

HRS-TO-OISC 

WORK-CAT 

SMN-NUM 

MOD-CHANGED 

MOD-CH-VERS 

COR-TYPE 

COR-MECH 

ACT-CAT 

DATE-BEGUN 

DATE-CLOSE 

DAYS-OPEN 

HHRS-TO-FIX 

NUM-CHANGED 

CODE-CONT 

PROB-DESC 

CORR-DESC 

ERROR-OESC 


NUMBER  OF  ERRORS  DISCOVERED  PER  TEST 

SOFTWARE  PROBLEM  REPORT  NUMBER 

DATE  THE  PROBLEM  WAS  REPORTED 

THE  MODULE  ID  WHERE  THE  PROBLEM  WAS  MANIFESTED 

ERROR  CATEGORY  TYPE 

ERROR  CATEGORY  CODE 

SEVERITY  TYPE 

SEVERITY 

TYPE  OF  TERMINATION 
HOURS  TO  DISCOVERY 

THE  TYPE  OF  DEVELOPMENT  TASK  PERFORMED 

SOFTWARE  MODIFICATION  NOTICE  NUMBER 

THE  ID  OF  THE  CHANGED  MODULE 

THE  VERSION  OF  THE  CHANGED  MODULE 

CORRECTION  TYPE 

CORRECTION  MECHANISM 

THE  TYPE  OF  TEST  PERFORMED 

DATE  WHEN  PROBLEM  SOLUTION  WAS  INITIATED 

DATE  WHEN  PROBLEM  WAS  REPORTED  TO  BE  CLOSED 

NUMBER  OF  DAYS  BETWEEN  DATE  OPEN  AND  DATE  CLOSE 

HUNDRETHS  OF  HOURS  TO  FIX 

NUMBER  OF  SOURCE  STATEMENTS  CHANGED 

A  CODE  THAT  INDICATES  AN  SPR  DOCUMENTS  MORE  THAN  1  PROBLEM 
A  DESCRIPTION  OF  THE  PROBLEM 
A  DESCRIPTION  OF  THE  CORRECTION 
A  DESCRIPTION  OF  THE  ERROR 


SOURCE:  f  DUVALL- 79 


APPENDIX  H 


RECORD  FORMAT  for  the 
DACS  COMPOSITE  PRODUCTIVITY  DATASET 


Record  Format 

Composite  Productivity  Dataset 


Field 

Description 

Format 

Comments 

1-  4 

Project  Code 

1(4) 

5-  11 

Project  Size 

1(7) 

In  source  lines 

12-  16 

Project  Development  Effort 

1(5) 

In  manmonths 

17-  19 

Project  Schedule  Duration 

1(3) 

In  months 

20-  22 

Average  Staff  Size 

1(3) 

23-  27 

Productivity 

1(5) 

Lines  per  Manmonth 

28-  32 

Number  of  Documented  Changes 

1(5) 

33-  35 

Program  Design  Language  Usage 

1(3) 

Expressed  as  percent 

999  =  Data  no  recorded 

36-  38 

Structured  Code  Usage 

1(3) 

Expressed  as  percent 

999  =  Data  not  recorded 

39-  41 

Top  Down  Programming  Usage 

1(3) 

Expressed  as  percent 

999  =  Data  not  recorded 

42-  44 

Chief  Programmer  Team  Usage 

1(3) 

Expressed  as  percent 

999  *  Data  not  recorded 

i 

tn 

** 

Programmer  /  Librarian  Usage 

1(3) 

Expressed  as  percent 

999  =  Data  not  recorded 

48-  50 

Formal  Code  Review  Usage 

1(3) 

Expressed  as  percent 

999  *  Data  not  recorded 

51-  61 

Primary  Language 

A(  11) 

62 

Dataset  Origin  Code 

A(  1) 

R  »  DACS  Productivity  Dataset 
N  *  NASA-SEL  Software  Eng. 
Dataset 

V  =  im  Dataset 


